M L A G G V R 7 

SASGCCAGICrOTTWCCTCCCAAOGCea^ ATG CTG GCG GGG GGC GTG AGG 151 

3MP5PLLACWQPILLLVLGS 27 

AGC ATG CCC AGC CCC CTC CTG GCC TGC TGG CAG CCC ATC CTC CTG CTG GTG CTG GGC TCA 211 

VLSGSATGCPPRCECSAQDR 47 

GTG CTG TCA GGC TCG GCC ACG GGC TGC CCG CCC CGC TGC GAG TGC TCC GCC CAG GAC CGC 271 

AVLCHRKRFVAVPEGIPTE? 67 

GCT GTG CTG TGC CAC CGC AAG CGC TTT GTG GCA GTC CCC GAG GGC ATC CCC ACC GAG ACG 331 

a-LDLGKNRIKTLMQDEFAS 37 

CGC CTG CTG GAC CTA GGC AAG AAC CGC ATC AAA ACG CTC AAC CAG GAC GAG TTC GCC AGC 391 

??HL'SELELNSNIVSAVSPG 107 

TTC CCG CAC CTG GAG GAG CTG GAG CTC AAC GAG AAC ATC GTG AGC GCC GTG GAG CCC GGC 451 

AFNNLFNLRTLGLR3NRLKL 127 

GCC TTC AAC AAC CTC TTC AAC CTC CGG ACG CTG GCT CTC CGC AGC AAC CGC CTG AAG CTC 511 

IPLGVFTGLSNLTKLDTREN 147 

ATC CCG CTA GGC GTC TTC ACT GGC CTC AGC AAC CTG ACC AAG CTG GAC ACG AGG GAG AAC 571 

KIVILLDYMFQDLYNLKSLE 167 

AAG ATC GTT ATC CTA CTG GAC TAC ATG TTT CAG GAC CTG TAC AAC CTC AAG TCA CTG GAG 631 

VGDNDLVYISHRA~FSGLNSL L37 

GTT GGC GAC AAT GAC CTC GTC TAC ATC TCT CAC CCC GCC TTC AGC GGC CTC AAC AGC CTG 691 

EQLTLEKCMLTSI PTEALSH 207 

GAG CAG CTG ACT CTG GAG AAA TGC AAC CTG ACC TCC ATC CCC ACC GAG GCG CTG TCC CAC 751 

L H G L IVLRLRHLN INAIRDY 227 

CTG CAC GGC CTC ATC GTC CTG AGG CTC CGG CAC CTC AAC ATC AAT GCC ATC CGG GAC TAC 311 

SFKRLYRLKVLEI SHWPYLD 247 

TCC TTC AAG AGG CTG TAC CGA CTC AAG GTC TTG GAG ATC TCC CAC TGG CCC TAC TTG GAC 971 

tmtpnclyglnltslsxt'hc 2S7 

ACC ATG ACA CCC AAC TGC CTC TAC GGC CTC AAC CTG ACG TCC CTG TCC ATC ACA CAC TGC 931 

NLTAVPYLAVRHLVYLRFLN 237 

AAT CTG ACC GCT GTG CCC TAC CTG GCC GTC CGC CAC CTA GTC TAT CTC CGC TTC CTC AAC 991 

LSYN? ISTIEGSMLHSLLRL 307 

CTC TCC TAC AAC CCC ATC AGC ACC ATT GAG GGC TCC ATG TTG CAT GAG CTG CTC CGG CTG 1051 

QEIQ LVGGQLAVVEPYAFRG 327 

CAG GAG ATC CAG CTG GTG GGC GGG CAG CTG GCC GTG GTG GAG CCC TAT GCC TTC CGC GGC 1111 

LNYLRVLNVSGNQLTTLEES 347 

CTC AAC TAC CTG CGC GTG CTC AAT GTC TCT GGC AAC CAG CTG ACC ACA CTG GAG GAA TCA 1171 
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7 F H 5 7 3 M 
GTC TTC CAC TCG GTG GGC AAC 

D C R L L W 7 
3AC TCT CGG CTC CTG TGG GTG 

? 7 C A T ? E 
CCC ACG TGC GCC ACG CCC GAG 

L L P N Y F 7 
CIA CTG CCC AAC TAG TTC ACC 

V ? V D S G H 
GTG TTT GTG GAC GAG GGC CAC 

P A I L W L S 
CCC GCC ATC CTC TGG CTC TCA 

L T 7 F P D G 
CTC ACA GTC TTC CCT GAT GGC 

T Y L C I A A 
ACG TAC CTG TGC ATC GCG GCC 

7 R S Y S P D 
GTG CGC AQC TAC TCG CCC GAC 

N Q P G E G E 
AAC GAG CCG GGC GAG GGA GAG 

I K T L I I A 
ATC AAG ACC CTC ATC ATC GCC 

P C L 7 L L F 
TTC TGC CTG GTG CTG CTG TTT 

E I E Y 7 P R 
GAG ATC GAG TAT GTG CCC CGA 

K F N M K M I 
AAG TTC AAC ATG AAG ATG ATA 



G GAG 

F R 



F V 
TTT GTC 

C R 
TGC CGC 

T 7 
ACG GTG 

P R 
CCC CGA 

T L 
ACG CTG 

M A 
AAC GCG 

W ? 
TGG CCC 

A M 
GCC AAC 

T T 
ACC ACC 

L W 
CTC TGG 

K S 
AAG TCG 



TLIL03MPL.AC 
ACA CTC ATC CTG GAC TCC AAC CCG CTG GCC TGC 

RRWRLMFMRQQ 
CGC CGC TGG CGG CTC AAC TTC AAC CGG CAG CAG 

QGKEFKDFPD7 
CAG GGC AAG GAG TTC AAG GAC TTC CCT GAT GTG 

RARIRDRKAQQ 
CGC GCC CGC ATC CGG GAC CGC AAG GCC CAG CAG 

QF7CRADGDP? 
CAG TTT GTG TGC CGG GCC GAT GGC GAC CCG CCG 

KHLVSAKStfGR 
AAG CAC CTG GTC TCA GCC AAG AGC AAT GGG CGG 

E7RYAQ7QDNG 
GAG GTG CGC TAC GCC CAG GTA CAG GAC AAC GGC 

GGNDSMPAHLH 
GGC GGC AAC GAC TCC ATG CCC GCC CAC CTG CAT 

HQPNKTFAFIS 
CAT CAG CCC AAC AAG ACC TTC GCT TTC ATC TCC 

STRAT7PFPF0 
AGC ACC CGC GCC ACT GTG CCT TTC CCC TTC GAC 

MGFISFLG77L 
ATG GGC TTC AT^TCT TTC CTG GGC GTC GTC CTC 

SRGKGMTKHNI 
AGC CGG GGC AAG GGC AAC ACA AAG CAC AAC ATC 

DAGISSADAPR 
GAC GGA GGC ATC AGC TCC GCC GAC GCG CCC CGC 



TGA 



367 
1231 

387 

■291 

407 
1351 

427 
1411 

447 
1471 

467 
1531 

487 

1591 

507 
1551 

527 
1711 

547 

1771 

567 
1331 

587 
1391 

507 
1951 

615 
1975 



TACCAGGACCTCAGAAGCCCAGACCTGGGGACCCCACCTAC^ 
GCTTTCAATAACTATOGA ITITIA TGAAAACTTGAAA 



2054 
2133 
2212 
2291 
2351 
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Zr&uz file T81; Output File T811.?at 
Sequence length 979 

M A G S P 5 

GAATTC CG CACGAGGCCAGCCAGTCCGCCSGVMCGRRG^ ATG GOG GGG TOG CCG 72 

1LWG.:PRAGGVGLLVLLLLGL 25 

CTG CTC TGG GGG COG CGG GCC GGG GGC GTC CGC OTP TTG GTG CTG CTG CTG CTC GGC CTG 132 

FRPPPALCARPVKEPRGLSA 45 

TTT CGG COG CCC CCC GCG CTC TGC GCG CGG CCG GTA AAG GAG CCC CGC GGC CIA AGC GCA 192 

ASPPLASTGAPRRFRRSVPR 55 

GCG TOT CCG CCC TTG GCT GAG ACT GGC GCT CCT CGC CGC TTC CGG CGG TCA GTG CCC CGA 252 

GEAAGAVQ S LARALAHLLEA 85 

GGT GAG GCG GCG GGG GCG GTG CAG GAG CTG GCG CGG GCG CTG GCG CAT CTG CTG GAG GCC 312 

ERQERARAE AQEAEDQQARV 105 

GAA CGT CAG GAG CGG GCG CGG GCC GAG GCG CAG GAG GCT GAG GAT CAG CAG GCG CGC GTC 372 

LAQLLRVWGAPRMSDPALGL 125 

CTG GCG CAG CTG CTG CGC GTC TGG GGC GCC CCC CGC AAC TCT GAT CCG GCT CTG GGC TTG 432 

□ DDPDAPAAQLARALLRARL 145 

GAC GAC GAC CCC GAC GCG CCT GCA GCG CAG CTC GCT CGC GCT CTG CTC CGC GCC CGC CTT 492 

OPAALAAQ LVPAPVPAAAL R 165 

GAC CCT GCC GCC CTA GCA GCC CAG CTT CTC CCC GCG CCC CTC CCC GCC GCG GCG CTC CGA 552 

PRPPVYDDG PAGPDAEEAGD 185 

£CC CGG CCC CCG GTC TAC GAC GAC GGC CCC GCG GGC CCG GAT GCT GAG GAG GCA GGC GAC 512 

ETPDVDPELLRYLLGRILAG2Q5 

GAG ACA CCC GAC GTG GAC CCC GAG CTG TTG AGG TAC TTG CTG GGA CGG ATT CTT GCG GGA 672 

SADSEGVAAP RRLRRAADHD 225 

AGC GCG GAC TCC GAG GGG GTG GCA GCC CCG CGC CGC CTC CGC CGT GCC GCC GAC CAC GAT 732 

VGSELPPEGVLGALLRVKRL245 

GTG GGC TCT GAG CTG CCC CCT GAG GGC GTG CTG GGG GCG CTG CTG CCT GTG AAA CGC CTA 792 

ETPAPQVPARRLLPP * 261 

GAG ACC CCG GCG CCC CAG GTG CCT GCA CGC CGC CTC TTG CCA CCC TGA 840 

GCA C rGCCCO G AlCC C GT GC A C CCTQ GG ACCCAGAA CT 519 

CTCGaSAXAACTTACCCCGGCCAGCCAGCCCTCTC 
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Percent Similarity : r/: 29.412 

T79 1 MLAGC -wiCWQPILLLVLGSVLSGS. . ATGCPPRCECSAQDR. 47 

: || |.:| |::.|: .:| ..:||. | |. ..: 

D45913 1 MARLSTGKAAC . QWLGLLITSLTESSILTSECPQLCVCEIRPWF 44 

T79 48 AVLCHRKRFVAVPEGIPTETRLLDLGKNRIKTLNQDEFAS 37 

D45913 45 TPQSTYREATTVDCNDLRLTRIPGNLSSDTQVLLLQSNNI 84 

• • < • • 

T79 88 FPHI^ELEIilENIVSAVEPGAFNNLFNI^^ 137 

D45913 85 AKTVDELQQLFNLTELDFSQNNFTNIKEVGLANLT 119 

T79 138 MLTKLDTRENKIVI LLDYMFQDLYNLKSLEVGDNDLVYI SHRAFSGLNSL 187 

-II. I. II. I : I I :||| II.. I :.-|:: II • i I I I I - • I 
D4S913 120 QLTTLHLEENQISEMTDYCLQDLSNLQELYINHNQISTISAKAFSGLKNL 169 

T19 188 EQLTLERCNLTSIPTEALSHLHGLIVIJUJUiLNIlIAXROYSFKRLYXlLKV 237 

i -I h. -I- I-. ..| :|.s • : :| I .|:.| .1: 

135913 170 LRIiil^SNKUCVIDSRWFDSTPNLEII^IGm^ 219 

• • • . . 

1F79 238 LEISHWPYLDTMTPNCLYGLN . LTSLSITHCNLTAVPYLAVRHLVYLRFL 286 

™ I II-.:.. I- I lh I. Ill: • -I- 11 Ih-.:. Ml 

g.45913 220 LVLAG . MYLTDVPGNALVGLDSLESLSFYDNKLIKVPQLALQK7PMLKFL 268 

. . . • • 
^p79 * 287 NLSYNPISTIEGSMLHELLRLQEIQLVG.GQLAWE^Y 323 

L :|. HI -I::: :-::M|.|:.: . hi- h.l 

IS4S913 269 DLNKNPIHKIQEGDFKNMLRLKELGINNMGELVSVDRYALDNLPELTKLE 318 

HI79 324 AFRGLNYIJIVLNVSGNQLTTLEESVFHSVGNLETLIL 360 

^3 llh: I I : . . | . | . . : : I : • I I • : ' 

fJ45913 319 ATNNPKLSYIHRIJIFRSVPALESLMLNNNALNAVYQKTVESLPNIilEISI 368 

: T79 361 DSNPLACIXrRLLWVFRRRWRLNFNRQQPT.CATPEFVQGKEFKDFPDVLL 409 

.1111 III : h .-: -:.| - - II I * -I-:. I HI* 
D45913 369 HSNPLRCDC7IHWINSNKTNIRFMEPLSMFCAMPPEYRGQQVK. . .EVLI 415 

a . • • • 

T79 410 PNYFT . CRRARIRDRKAQQVFVDEGHTVQFVCRADGDPPPAILWLSPRKH 458 

.: . | . :| :•:: :| I |: : III ::|.|.|.|:.| : 
D45913 416 QDSSEQCLPMISHDTFPNHLNMDIGTTLFLDCRAMAEPEPEIYWVTPIGN 465 

• • • • • 

T79 459 LVSAKS . NGRLTVFPDGTLEVRYAQVQDNGTYLCIAANAGGNDSMPAHLH 507 

: -Hill: I = = I - I I I : I • I • • I • I • • • I 

D45913 466 KITVETLSDKYKLSSEGTLEIANIQIEDSGRYTCVAQNVQGADTRVATIK 515 

• . . • * 

T79 508 V RSYSPDWPHQ 518 

I : I 

D45913 516 VNGTLIXGAQVI.KIYVKQTESHS ILVSWKVN^ 565 

T79 519 PNKTF AFX SNQ PGEGEANSTRA 540 
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34 59 13 566 PHITYTARVPVD^BpjLXHLQPSTDYEVCLTVSNIHQQTQKSO^BTC 615 

T7 9 541 TVPFPFDIKTLIIATTMGFI . . 5FLGWLFCLVLLFLWSRGKGNTKHNIE 538 

I -I I I . . | .:: : | ::: | : :. : : : : . | | . . | . 
D45913 616 TAAFALDISDHETSTALAAVMGSMFAVISLASIAIYIAKRFKRKNVHHSL 665 

T79 589 IEYVPRKSDAGISSADAPRKFNMKMI '. 614 

D45913 666 KKYMQKTSSIPLNEL. YPPLINLWEADSDKDKDGSADTKPTQVDTSRSYY 714 
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FIG. 4 




TRPILVIHDEQKGPEVTSN 19 

CC ACG CGT CCS ATC TO GTC ATC CAC GAT GAA CAG AAG GGG CCC GAA GTG ACC TCC AAT 59 

AAL?LRMFCMWQKQHWP?SD 39 

gct gcc crc act ctg cgg aac ttt tgc aac tgg cag aag cag CAC AAC CCA CCC ACT GAC 119 

RDAEHYDTAILFTRQDLCGS 59 

CGG GAT GCA GAG CAC TAT GAC ACA GCA ATT CTT TTC ACC AGA CAG GAC TTG TGT GGG TCC 179 

Q TCDTLGMADVGTVCDPSRS 79 

CAG ACA TGT GAT ACT CTT GGG ATG GCT GAT GTT GGA ACT GTG TGT GAT CCG AGC AGA AGC 239 

CSVIEDDGLQAAFTTAHELG 99 

TGC TCC GTC ATA GAA GAT GAT GGT TTA CAA GCT GCC TTC ACC ACA GCC CAT GAA TTA GGC 299 

HVFMMPHDDAKQCASLMGVM 119 

CAC GTG TTT AAC ATG CCA CAT GAT GAT GCA AAG CAG TGT GCC AGC CTT AAT GGT GTG AAC 359 

QDSHMMASMLSNLDKSQPWS 139 

CAG GAT TCC CAC ATG ATG GCG TCA ATG CTT TCC AAC CTG GAC CAC AGC CAG CCT TGG TCT 419 

PCSAYMITSFI-DMGHGSCLM 159 

CCT TGC ACT GCC TAC ATG ATT ACA TCA TTT CTG GAT AAT GGT CAT GGG GAA TGT TTG ATG 479 

DKPQNPIQLPGDLPGTSYDA 179 

GAC AAG CCT CAG AAT CCC ATA CAG CTC CCA GGC GAT CTC CCT GGC ACC TCG TAC GAT GCC 539 

NRQCQFTFGEDSKHC PDAAS 199 

AAC CGG CAG TGC CAG TTT ACA TTT GGG GAG GAC TCC AAA CAC TGC CCT GAT GCA GCC AGC 599 

TCSTLWCTGTSGGVLVCQTK 219 

ACA TGT AGC ACC TTG TGG TGT ACC GGC ACC TCT GGT GGG GTG CTG GTG TGT CAA ACC AAA 659 

HFPWADGTSCGEGKWCINGK 239 

CAC TTC CCG TGG GCG GAT GGC ACC AGC TGT GGA GAA GGG AAA TGG TGT ATC AAC GGC AAG 719 

CVNKTDRKHFDTPFHGSWGM 259 

TOT GTG AAC AAA ACC GAC AGA AAG CAT TTT GAT ACG CCT TTT CAT GGA AGC TGG GGA ATG 779 

WGPWGDCSRTCGGGVQYTMR 279 

TGG GGG CCT TGG GGA GAC TGT TCG AGA ACG TGC GGT GGA GGA GTC CAG TAC ACG ATG AGG 839 

ECDNPVPKNGGKYCEGKRVR 299 

GAA TGT GAC AAC CCA GTC CCA AAG AAT GGA GGG AAG TAC TGT GAA GGC AAA CGA GTG CGC 399 

YRSCNLEDCPDNNGKTFREE 319 

TAC AGA TCC TGT AAC CTT GAG GAC TGT CCA GAC AAT AAT GGA AAA ACC TTT AGA GAG GAA 959 

QCEAHNEF SKASFGSG PAVE 339 

CAA TGT GAA GCA CAC AAC GAG TTT TCA AAA GCT TCC TTT GGG ACT GGG CCT GCG GTG GAA 1019 

WIPKYAGVSPKDRCKLICQA 359 

TGG ATT CCC AAG TAC GCT GGC GTC TCA CCA AAG GAC AGG TGC AAG CTC ATC TGC CAA GCC 1079 

KGIGYFFVLQ PKVVDGTPC S 379 

AAA GGC ATT GGC TAC TTC TTC GTT TTG CAG CCC AAG GTT GTA GAT GGT ACT CCA TGT AGC 1139 
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POSTS VCVQGQCVKAGCDRI 
CCA GAT TCC ACC TCT GTC TGT GTG CAA GGA CAG TGT GTA AAA GCT GGT TGT GAT CGC ATC 

IDSKXKFDKCGVCGGMGS TC 
ATA GAC TCT AAA AAG AAG TTT GAT AAA TGT GGT GTT TGC GGG GGA AAT GGA TCT ACT TGT 

KKISGSVTSAKPGVKDIITI 
AAA AAA ATA TCA GGA TCA GTT ACT AGT GCA AAA CCT GGA TAT CAT GAT ATC ATC AGA ATT 

PTGATNIEVKQRNQRGSRNN 
CCA ACT GGA GCC ACC AAC ATC GAA GTG AAA CAG CGG AAC CAG AGG GGA TCC AGG AAC AAT 

G S F L A I K A. A DGTYILNGDYT 
GGC AGC TTT CTT GCC ATC AAA GCT GCT GAT GGC ACA TAT ATT CTT AAT GGT GAC TAC ACT 

LSTLEQDIMYKGVVLRYSGS 
TTG TCC ACC TTA GAG CAA GAC ATT ATG TAC AAA GGT GTT GTC TTG AGG TAC AGC GGC TCC 

SAALERIRSFSPLKEPLTIQ 
TCT GCG GCA TTG GAA AGA ATT CGC AGC TTT AGC CCT CTC AAA GAG CCC TTG ACC ATC CAG 

V L T V G N A 1 R ? K I K Y T Y F V K K 
GTT CTT ACT GTG GGC AAT GCC CTT CGA CCT AAA ATT AAA TAC ACC TAC TTC GTA AAG AAG 

KKESFMAIPTFSAWVIEEWG 
AAG AAG GAA TCT TTC AAT GCT ATC CCC ACT TTT TCA GCA TGG GTC ATT GAA GAG TGG GGC 

ECSKTCGKGYKKRSLKCLSH 
GAA TGT TCT AAG ACC TGT GGG AAG GGT TAC AAA AAA AGA AGC TTG AAG TGT CTG TCC CAT 

DGGVLSHSSCDPLKKPKHFI 
GAX GGA GGG GTG TTA TCT CAT GAG AGC TGT GAT CCT TTA AAG AAA CCT AAA CAT TTC ATA 

DFCTMAEC S * 
GAC TTT TGC ACA ATG GCA GAA TGC AGT TAA 



GTGGTTTO 



3AAGGCTGGAGGG 



ATCCAGCGTATCTTGCCAGTAACCAGTGAGGTGTATC 

CATCAGAGTAAACTOCCAGTTGCAAATITGATAGGAT^ 

AT3UUUS a^ XG GGC ATTATTATT m 

AAAAGTTAGAACTRrrACAACCCCTGTTTCCTGGTAC^ 

AGGAGAAAAGXGAGATITTACTAAa 

TCXrXGftOacaUCXGTrrA^ 

GATICTICAACAIGAGAGAAAGGCTCAGCAAOGTnAAAI 
GTCTTTArrTGTGTAATTCA^ 

AAAAGCAGTGAAGCATrAGATGCTGGrAAAAGCTAGAGG 
TACCATOIAA UJJlU-ri ' lUia AATATGGAIGT 



399 
1199 

419 
1259 

439 
1319 

459 
1379 

479 
1439 

499 

1499 

519 
1559 

539 
1619 

559 
1679 

579 
1739 

599 
1799 

609 
1829 

1908 

1987 

2066 

2145 

2224 

2303 

2382 

2461 

2540 

2619 

2698 
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2777 

ATTTCTAATGAAAAAAACTTITAAATTATA 3 09 3 
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• # 

Ccaparison (90% Protein S^Rce Idencicy) 

Mll-'ll 

I TRFILVXBDEQ 11 

301 KGPEVTSNAALTLIlNPCSWQKQHNSPSnRDPHfi^^ 350 

Mllllllllllllll|.||||l|.|lllhllllllllllllllllhl , 

12 KGFEVTSSIAALTIitNFCNNQKQKNPPSDREAEKSfCTA 61 

. • * 

351 CCTLQlMMTrVCTPSRSCSra 400 

illliliiilliiiiiiiiiillilllllllllllliliinillllli : 

62 CPILGMftDVGrrVCDPSRSCS\rrFT^ l* 1 

401 r^jarmrm^aj cmt jg<n\Tra^ PWSPe^VMin > SFIiaK^^ 450 

lllllll- -111:111111.1111111111111101111 1111111 111 
112 CASLICVNQDSHMMASMLSIILnHSQPWSPCSAYIgTSF XTTl itli iWTM DK 161 

• • • • * 

451 PQNPIKI^SBLPCTLYDANRULXJ^ 500 

iiiii-ihiiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 - 1 1 > JLiJJJLL 

162 PQNPIQIJCTLPGTSYnANRQCQETPGfflSKECSDAASrCSlIWCTGTSG 211 

• • • • " 

501 e^ VLvmiFw a i x;r sL X^ ^ 550 
HIlMllllllllllllMllh-lllllllM IMH lMI N 

212 GVL VLVl ' ^ ' W ADGTSCC^GKWCPXaOCV ^ urPfH GSWGMWG 261 
551 PWGDCSRICGC33VQYTMRBCiaiPVP!TC 600 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIMI 

* 

1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 - 1 - 1 ! I - 1 M I i M 1 1 1 1 1 1 1 - 1 H i I 

312 19GKCT^£EQCEAHIIEFSKASFGSGPAVEWZPECYAGVSPKXBCKUCQA^ 361 
651 ICTFFVLQPKV V lXri' P C SPD^ 700 

IllMlllililllllllMllllllllllllllllllll lllMlllll 

362 IGOTVMPKVVarrPCSPDSTSVCVQGQCVKMOT 411 

. . • • • 

701 CGQJGSU UU U ISU XVTSTRPCra 750 

iiimiiihii iii.:iiiiihiii-mmihiiiimiiii 

412 CGQISSTCKKISGSVTSAKPGfYHDIITIPTGMNIZVKQra 461 

• • • • * 

751 FIAIRAADGT¥TLNGNFTLSTI£Q^ 800 
lllhlllllllll|::||IIIIII: I I I - 1 I I 1 1 1 I 1 1 1 1 I 1 1 1 1 I I I 

462 FuamMxzrrnjxsjrntt 511 

. . • • • 

801 rjEPLTioviiiraiMiiPiara^ 850 

llllllllll IhllllllhllhllMIIIIIIMMIIIIIM 
512 IiCEPLTXQVLTVQIMilPKIKrr^ 560 



901 WSPCSglO SKUXKiUa^ KCySHIX^^ 950 

illinium. iiHiiimHmii iiiiHiiii^ i 

561 . . . cailU^XiUU<SI3CCLS^^ IDf CmREC 607 



951 S* 951 

I 

608 S* 609 
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gtgcctac atg gtc acg tec ttc eta gat aat gga cac ggg gaa tgt ttg 50 
Met Val Thr Ser Phe Leu Asp Asn Gly His Gly Glu Cys Leu 

15 10 

atg gac aag ccc cag aat cca ate aag etc cct tct gat ctt ccc ggt 98 

Met Asp Lys Pro Gin Asn Pro lie Lys Leu Pro Ser Asp Leu Pro Gly 

15 20 25 30 

ace ttg tac gat gee aac cgc cag tgt cag ttt aca ttc gga gag gaa 146 

Thr Leu Tyr Asp Ala Asn Arg Gin Cys Gin Phe Thr Phe Gly Glu Glu 

35 40 45 

tec aag cac tgc cct gat gca gec age aca tgt act acc ctg tgg tgc 194 

Ser Lys His Cys Pro Asp Ala Ala Ser Thr Cys Thr Thr Leu Trp Cys 

50 55 60 

act ggc acc tec ggt ggc tta ctg gtg tgc caa aca aaa cac ttc cct 242 

Thr Gly Thr Ser Gly Gly Leu Leu Val Cys Gin Thr Lys His Phe Pro 

65 70 75 

tgsfj gca gat ggc acc age tgt gga gaa ggg aag tgg tgt gtc agt ggc 290 

Trpi Ala Asp Gly Thr Ser Cys Gly Glu Gly Lys Trp Cys Val Ser Gly 

□ 80 85 90 

aa &l t 9 c gtg aac aa g aca g ac at g aa g cat ttt get act cct gtt cat 338 

C Y S Val Asn L Y S T hr Asp Met Lys His Phe Ala Thr Pro Val His 

3§j 100 105 110 

gga^ age tgg gga cca tgg gga ccg tgg gga gac tgc tea aga acc tgt 3 86 

G^ Ser Trp Gly Pro Trp Gly Pro Trp Gly Asp Cys Ser Arg Thr Cys 

'it 115 120 125 

ggg ggt gga gtt caa tac aca atg aga gaa tgt gac aac cca gtc cca 4 34 

GSy Gly Gly Val Gin Tyr Thr Met Arg Glu Cys Asp Asn Pro Val Pro 

H= 130 135 140 

aag aac gga ggg aag tac tgt gaa ggc aaa cga gtc cgc tac agg tec 4 82 

Lys Asn Gly Gly Lys Tyr Cys Glu Gly Lys Arg Val Arg Tyr Arg Ser 

145 150 155 

tgt aac ate gag gac tgt cca gac aat aac gga aaa acg ttc aga gag 530 

Cys Asn lie Glu Asp Cys Pro Asp Asn Asn Gly Lys Thr Phe Arg Glu 

160 165 170 

gag cag tgc gag gcg cac aat gag ttt tec aaa get tec ttt ggg aat 578 

Glu Gin Cys Glu Ala His Asn Glu Phe Ser Lys Ala Ser Phe Gly Asn 

175 180 185 190 

gag ccc act gta gag tgg aca ccc aag tac gee ggc gtc teg cca aag 626 

Glu Pro Thr Val Glu Trp Thr Pro Lys Tyr Ala Gly Val Ser Pro Lys 

195 200 205 
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gac agg tgc aag etc acc tgt gaa gec aaa ggc att ggc tac ttt ttc 674 
Asp Arg Cys Lys Leu Thr Cys Glu Ala Lys Gly lie Gly Tyr Phe Phe 
210 215 220 

gtc tta cag ccc aag gtt gta gat ggc act ccc tgt agt cca gac tct 722 
Val Leu Gin Pro Lys Val Val Asp Gly Thr Pro Cys Ser Pro Asp Ser 
225 230 235 

acc tct gtc tgt gtg caa ggg cag tgt gtg aaa get ggc tgt gat cgc 770 
Thr Ser Val Cys Val Gin Gly Gin Cys Val Lys Ala Gly Cys Asp Arg 
240 245 250 

ate ata gac tec aaa aag aag ttt gat aag tgt ggc gtt tgt gga gga 818 
lie lie Asp Ser Lys Lys Lys Phe Asp Lys Cys Gly Val Cys Gly Gly 
255 260 265 270 

aac ggt tec aca tgc aag aag atg tea gga ata gtc act agt aca aga 866 
Asn Gly Ser Thr Cys Lys Lys Met Ser Gly lie Val Thr Ser Thr Arg 
275 280 285 

c<E3 ggg tat cat gac att gtc aca att cct get gga gee acc aac att 914 
P£§ Gly Tyr His Asp He Val Thr He Pro Ala Gly Ala Thr Asn He 
CQ 290 295 300 

94ff gtg aaa cat egg aat caa agg ggg tec aga aac aat ggc age ttt 962 
Gia; Val Lys His Arg Asn Gin Arg Gly Ser Arg Asn Asn Gly Ser Phe 

l£ 305 310 315 

ctg get att aga gee get gat ggt acc tat att ctg aat gga aac ttc 1010 
L^u Ala He Arg Ala Ala Asp Gly Thr Tyr He Leu Asn Gly Asn Phe 
rt 320 325 330 

acj ctg tec aca eta gag caa gac etc acc tac aaa ggt act gtc tta 1058 
Tftt Leu Ser Thr Leu Glu Gin Asp Leu Thr Tyr Lys Gly Thr Val Leu 
3j*4 340 345 350 

agg tac agt ggt tec teg get gcg ctg gaa aga ate cgc age ttt agt 1106 
Arg Tyr Ser Gly Ser Ser Ala Ala Leu Glu Arg He Arg Ser Phe Ser 
355 360 365 

cca etc aaa gaa ccc tta acc ate cag gtt ctt atg gta ggc cat get 1154 
Pro Leu Lys Glu Pro Leu Thr He Gin Val Leu Met Val Gly His Ala 
370 375 380 

etc cga ccc aaa att aaa ttc acc tac ttt atg aag aag aag aca gag 1202 
Leu Arg Pro Lys He Lys Phe Thr Tyr Phe Met Lys Lys Lys Thr Glu 
385 390 395 

tea ttc aac gee att ccc aca ttt tct gag tgg gtg att gaa gag tgg 1250 
Ser Phe Asn Ala He Pro Thr Phe Ser Glu Trp Val He Glu Glu Trp 
400 405 410 
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999 ^ag tgc tec aag aca tgc ggc tea ggt tgg cag aga aga gta gtg 1298 
Gly Glu Cys Ser Lys Thr Cys Gly Ser Gly Trp Gin Arg Arg Val Val 
415 420 425 430 

cag tgc aga gac att aac gga cac cct get tec gaa tgt gca aag gaa 1346 
Gin Cys Arg Asp lie Asn Gly His Pro Ala Ser Glu Cys Ala Lys Glu 
435 440 445 

gtg aag cca gee agt acc aga cct tgt gca gac ctt cct tgc cca cac 1394 
Val Lys Pro Ala Ser Thr Arg Pro Cys Ala Asp Leu Pro Cys Pro His 
450 455 460 

tgg cag gtg ggg gat tgg tea cca tgt tec aaa act tgc ggg aag ggt 1442 
Trp Gin Val Gly Asp Trp Ser Pro Cys Ser Lys Thr Cys Gly Lys Gly 
465 470 475 

tac aag aag aga acc ttg aaa tgt gtg tec cac gat ggg ggc gtg tta 1490 
Tyr Lys Lys Arg Thr Leu Lys Cys Val Ser His Asp Gly Gly Val Leu 
480 485 490 

tca_ aat gag age tgt gat cct ttg aag aag cca aag cat tac att gac 1538 
Se&J Asn Glu Ser Cys Asp Pro Leu Lys Lys Pro Lys His Tyr lie Asp 
4£fi 500 505 510 

ttffl tgc aca ctg aca cag tgc agt taagaggcgt tagaggacaa ggtagcgtgg 1592 
PHej Cys Thr Leu Thr Gin Cys Ser 

In 515 

99fpgggctg atacactgag tgcaagagta ctggagggat ccagtgagtc aaaccagtaa 1652 
gqagtgaggt gtggcaagga ggtgtgtgta ggggatacat agcaaaggag gtagatcagg 1712 
ao^aptaccct gecagttaca ttctgataag gtagttaatg aggcacagta gcatctgaaa 1772 
ga^catacag agcactaagg agccccaaag cactattagt atctcttttc ttatatctat 1832 
c^qecaaata attttcagag tctggcagaa gccctgttgc actgtactaa ctagatactt 1892 
cfgatcacaa agattgggaa aggcaaagca gaaagatggt aagactgggt ttcaaacaag 1952 
9<SCtggtttc aatcactgga ggcaaggagg aggggacaaa caagatcatt attcgaagtc 2 012 
gdtfggttgct gtggttttac ggaaggttga tgeatcatte ctatcaacag tgaaaagttc 2 072 
adjcttgttca aegtgacaga aaggctcatc tccgtgaaag agctcctgat ttcttcttac 2132 
accatctcag ttcttaacta tagttcatgt tgaggtagaa acaattcatc tatttataaa 2192 
atgtacattg gaaaaaaaaa gtgaagttta tgaggtacac ataaaaactg aaggaaacaa 2252 
tgagcaacat gcctcctgct ttgcttcctc ctgaggtaaa cctgcctggg gattgaggtt 2312 
gtttaagatt atccatggct cacaagaggc agtaaaataa tacatgttgt gecagagtta 2372 
gaatggggta tagagatcag ggtcccatga gatggggaac atggtgatca ctcatctcac 2432 
atgggaggct getgeagggt agcaggtcca ctcctggcag ctggtccaac agtegtatec 2492 
tggtgaatgt ctgttcagct cttctactga gagagaatat gactgtttcc atatgtatat 2552 
gtatatagta aaatatgtta ctatgaattg catgtacttt ataagtattg gtgtgtctgt 2612 
tccttctaag aaggactata gtttataata aatgectata ataacatatt tatttttata 2672 
catttatttc taatgataaa acctttaagt tatatcgett ttgtaaaagt gcatataaaa 2732 
atagagtatt tatacaatat atgttaacta gaaataataa aagaacactt ttgaatgtgt 2792 
atgcctattt tctggagtgg gattaacttc tgggcaagaa atctgatgag acacaaacat 2852 
tggacttcaa gacagtttta aattttgggt aaatgaactg tatttcctgt ttatagacgt 2912 
actaataaaa aagaagttga tgatgtcttt agtggtaaga ttgttactaa tgtggttggc 2972 
aaattgctgt aaagagccag atagtaagca tttatggcat tgtaggctat ctttcctgcc 3032 
acaaccatgt gacagtgagt gctttgtagg actgagagca gecataaatg acatgtaaat 3092 
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gata»aactgt ggctgtgctt taataaaact ttatttacaa aaaaaaaaaa aaa 



3145 
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gcggccgctc ccggccggcc caagggacag agccaggctc cgggagcccg caacactcgt 60 
cctgagagcc ccggctcctc agcccgctac ggccagggcc tcggcctccg cccccgactc 12 0 
ccgagctcct gccctagagt cgactgggct cccgcccgcg tgggacagac agacggacag 180 
ccagccctgc gagggcgcgc ggaccgggcg gaggtgttgt aggaggagac cgaggagggg 240 
ggctgggctg gggctggggc cgcgccggca agagagacat gcgattggtg accaagccga 3 00 
gcggacggac agcgcgcccg ag atg cag gtg age gag agg atg ctg gca ggg 3 52 

Met Gin Val Ser Glu Arg Met Leu Ala Gly 
15 10 



ggt atg aga age atg ccc age ccc etc ctg gee tgc tgg cag ccc ate 
Gly Met Arg Ser Met Pro Ser Pro Leu Leu Ala Cys Trp Gin Pro lie 
15 20 25 



400 



etc ctg ctg gta ctg ggc tea gtg ctg tea ggc tct get aca ggc tgc 
Leu Leu Leu Val Leu Gly Ser Val Leu Ser Gly Ser Ala Thr Gly Cys 
30 35 40 



448 



ccg ccc cgc tgc gag tgc tea gcg cag gac cga gec gtg etc tgc cac 
Pro Pro Arg Cys Glu Cys Ser Ala Gin Asp Arg Ala Val Leu Cys His 
45 50 55 



496 



cgs aaa cgc ttt gtg gcg gtg ccc gag ggc ate ccc acc gag act cgc 
Afg Lys Arg Phe Val Ala Val Pro Glu Gly lie Pro Thr Glu Thr Arg 

C3 60 65 70 



544 



ctg ctg gac ctg ggc aaa aac cgc ate aag aca etc aac cag gac gag 
Le§ Leu Asp Leu Gly Lys Asn Arg lie Lys Thr Leu Asn Gin Asp Glu 
7§ 80 85 90 



592 



tpt gee age ttc cca cac ctg gag gag eta gaa etc aat gaa aac ate 
PfiJ? Ala Ser Phe Pro His Leu Glu Glu Leu Glu Leu Asn Glu Asn lie 
m 95 100 105 



640 



gS age gee gtg gag cca ggc gec ttc aac aac etc ttc aac ctg agg 
Vjki Ser Ala Val Glu Pro Gly Ala Phe Asn Asn Leu Phe Asn Leu Arg 
?:=: 110 115 120 



688 



act ctg ggg ctg cgc age aac cgc ctg aag ctt ate ccg ctg ggc gtc 
Thr Leu Gly Leu Arg Ser Asn Arg Leu Lys Leu lie Pro Leu Gly Val 
125 130 135 



736 



ttc acc ggc etc age aac ttg acc aag ctg gac ate agt gag aac aag 
Phe Thr Gly Leu Ser Asn Leu Thr Lys Leu Asp lie Ser Glu Asn Lys 
140 145 150 



784 



ate gtc ate ctg eta gac tac atg ttc caa gac eta tac aac etc aag 
lie Val lie Leu Leu Asp Tyr Met Phe Gin Asp Leu Tyr Asn Leu Lys 
155 160 165 170 



832 



teg ctg gag gtc ggc gac aac gac etc gtc tac ate tec cat cga gee 
Ser Leu Glu Val Gly Asp Asn Asp Leu Val Tyr lie Ser His Arg Ala 
175 180 185 



880 
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ttc age ggc etc aac age ctg gaa cag ctg acg ctg gag aaa tgc aat 92 8 
Phe Ser Gly Leu Asn Ser Leu Glu Gin Leu Thr Leu Glu Lys Cys Asn 
190 195 200 

ctg acc tec ate ccc acg gag gcg etc tec cac ctg cac ggc etc ate 976 
Leu Thr Ser lie Pro Thr Glu Ala Leu Ser His Leu His Gly Leu lie 
205 210 215 

gtc ctg egg eta cga cat etc aac ate aat gee ate agg gac tac tec 1024 
Val Leu Arg Leu Arg His Leu Asn lie Asn Ala lie Arg Asp Tyr Ser 
220 225 230 

ttc aag agg ctg tac cga ctt aag gtc tta gag ate tec cac tgg ccc 1072 
Phe Lys Arg Leu Tyr Arg Leu Lys Val Leu Glu lie Ser His Trp Pro 
235 240 245 250 

tac ctg gac acc ata acc ccc egg acg cgt ggg teg ac 1110 
Tyr Leu Asp Thr lie Thr Pro Arg Thr Arg Gly Ser 
255 260 
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ctcctggatg tgcgcagccg cagagcgctg ctgctgtgcc taatacccat cgctgcgcac 60 
ttgacagcca gtccgcccgt ccggagcccg gctcgttggg gcagc atg gcg ggg teg 117 

Met Ala Gly Ser 
1 

ccg ctg etc tgc ggg ccg egg gee ggg ggc gtc ggc att ttg gtg ctg 165 
Pro Leu Leu Cys Gly Pro Arg Ala Gly Gly Val Gly lie Leu Val Leu 
5 10 15 20 

ctg etc ttg ggc ctt ctg agg ctg ccc ccc acc ctg tea gcg agg ccc 213 
Leu Leu Leu Gly Leu Leu Arg Leu Pro Pro Thr Leu Ser Ala Arg Pro 
25 30 35 

gtg aag gag ccc cgc agt ctg age gca gca tec gcg ccc ttg gtt gag 261 
Val Lys Glu Pro Arg Ser Leu Ser Ala Ala Ser Ala Pro Leu Val Glu 
40 45 50 

acg age act ccc etc cgc ttg cgt egg gee gtg ccc cga gga gag gcg 309 
Thr Ser Thr Pro Leu Arg Leu Arg Arg Ala Val Pro Arg Gly Glu Ala 
55 60 65 

99t 9 C 9 gtg cag gag ctg gcg egg gcg ctg gcg cac ctg ctg gag 357 
A£i Gly Ala Val Gin Glu Leu Ala Arg Ala Leu Ala His Leu Leu Glu 
□ 70 75 80 

9 £3 9^9 aga cag gaa cgc gcg cgt get gag gcg cag gag get gag gat 405 
A|§ Glu Arg Gin Glu Arg Ala Arg Ala Glu Ala Gin Glu Ala Glu Asp 
83 90 95 100 

c Ef ca 9 9 c g cgt gtc ctg gcg cag ctg ctg cgc gec tgg ggc tct ccg 453 
GjLlfL Gin Ala Arg Val Leu Ala Gin Leu Leu Arg Ala Trp Gly Ser Pro 
S 105 110 115 

celt gec teg gac ccg ccc ttg gec ccc gac gat gac ccg gac get cca 5 01 
AJr§ Ala Ser Asp Pro Pro Leu Ala Pro Asp Asp Asp Pro Asp Ala Pro 
? ~ 120 125 130 

get gca cag etc gec cgt get ctg etc cga get cgc eta gac ccc ggc 549 
Ala Ala Gin Leu Ala Arg Ala Leu Leu Arg Ala Arg Leu Asp Pro Gly 
135 140 145 

ccc cag tgt atg atg atg gec cca ctg gec cag acg teg agg atg ccg 597 
Pro Gin Cys Met Met Met Ala Pro Leu Ala Gin Thr Ser Arg Met Pro 
150 155 160 

gcg acg aga etc ctg acg tgg acc ctg age tgc tgaggtactt getagggegg 650 
Ala Thr Arg Leu Leu Thr Trp Thr Leu Ser Cys 
165 170 175 

atcctcaccg gaagttcgga gecagagget gctcctgccc cgcgccgcct ccgccgatct 710 
gtggaccagg atttgggtcc cgaggtgccc cctgagaacg tactgggggc tctgctacgc 770 
gtcaaacgcc tggagaaccc ctcgccccag gcgccggcac gccgcctcct gcctccctga 830 
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gcgotgctgc atcctgcacg ccctggaacc caggagcgcc ccagcaaccc tgactccctg 890 
ccagcaqgtc caaggctgct taccccagca acctcccatc ccctgagccc tcaataaatg 950 
ccatctgtag caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1010 
aaaaaaaaaa aaaaaaa 1027 
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